Water Research
○ Elsevier BV
Preprints posted in the last 7 days, ranked by how well they match Water Research's content profile, based on 74 papers previously published here. The average preprint has a 0.10% match score for this journal, so anything above that is already an above-average fit.
Nsawotebba, A.; Morunyanga, I.; Nakintu, V.; Kabazzi, J.; Magala, J.; Uragiwenimana, V.; Ssekyondwa, S.; Kasujja, R.; Onywera, H.; Hull, N.; Akejo, D. S.; Dambya, C.; Ikoba, S.; Baraka, V.; Tebeje, Y. K.; Barigye, E.; Cham, F.; Ssewanyana, I.; Nabaasa, H.; Muruta, A.; Olaro, C.; Atwine, D.; Nabadda, S.; Acheng, J. R.
Show abstract
Mass gatherings pose significant public health risks by facilitating the spread of infectious diseases. While wastewater-based surveillance (WBS) has been widely used to monitor pathogens in high-income settings, its use as a practical, multi-pathogen surveillance tool during mass gatherings in low- and middle-income countries remains limited. This study aimed to assess the operational feasibility, epidemiological significance, and public health utility of multi-pathogen WBS during the African Nations Championship (CHAN) football tournament in Uganda. Wastewater surveillance was conducted at Mandela National Stadium during eight match days in August 2025. Moore swabs were deployed at 38 manholes receiving wastewater from different toilet facilities across the stadium to capture representative wastewater samples. Samples were processed using Nanotrap(R) microbiome virus particles to concentrate pathogens, followed by nucleic acid extraction. Samples were analyzed for multiple enteric and respiratory pathogens, including Mpox, using quantitative PCR (qPCR). Descriptive analyses were performed to characterize pathogen detection patterns, positivity rates, and temporal distribution across surveillance sites. A total of 304 wastewater samples were collected and analyzed, of which 259 (85.2%) tested positive for at least one pathogen. Multiple pathogens were consistently detected across sampling days, with enteric pathogens predominating, particularly Shigella spp. (53.6%), Rotavirus A (35.9%) and Enterovirus (32.2%). The mpox virus was also detected in a notable proportion of samples (28.6%) across several sampling days. Respiratory pathogens, including SARS-CoV-2 (11.8%) and Influenza B (8.2%), were identified intermittently at lower frequencies. Pathogen diversity varied over time, with up to eight pathogens detected on a single day, and co-detection of multiple pathogens observed in the majority of positive samples. Cq value distributions further demonstrated variability in detected signal patterns across pathogens. Surveillance findings informed real-time public health interventions, including sanitation reinforcement, intensified hygiene promotion, environmental disinfection, and targeted risk communication, strengthened syndromic surveillance with on-site triage, and targeted environmental health assessments of food handling and wastewater infrastructure. These findings demonstrate the operational feasibility and public health utility of integrating multi-pathogen wastewater-based surveillance into mass-gathering preparedness and response frameworks in low-resource settings. By capturing diverse pathogen signals and informing targeted interventions during the CHAN football tournament, WBS can provide actionable population-level insights that can support outbreak preparedness and response. Scaling WBS within national preparedness systems could strengthen epidemic intelligence, enhance early warning capacity, and support data-driven public health decision-making during future mass gatherings and emerging infectious disease threats.
Whitehill, F.; Lyons, A. K.; Abera, B.; Adler, C.; Burgos-Garay, M.; Campbell, M.; Santiago, A. J.; Ganim, C.; Moore, J.; Cahela, Y.; Lenz, S.; Gable, P.; Medrzycki, M.; Walters, M. S.; Keaton, A.; Cook, P. W.; Li, Y.; Tao, Y.; Zhang, J.; Malapati, L.; Retchless, A. C.; Tong, S.; Williams, M.; Donlan, R.; Coulliette-Salmond, A.
Show abstract
To understand the utility of healthcare facility-level wastewater surveillance (WWS) for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), it is important to correlate wastewater SARS-CoV-2 RNA detection with the number of clinical infections. WWS for SARS-CoV-2 was performed at three skilled nursing facilities (SNFs) over 25 weeks. Electronegative membrane filtration (enMF) and Nanotrap(R) Magnetic Virus Particles (NP) virus concentration methods were compared. Extracts were tested by droplet digital polymerase chain reaction. Spearman's correlations ({rho}) between wastewater virus RNA concentrations and infection counts were calculated. From split wastewater samples, enMF recovered higher SARS-CoV-2 RNA concentrations than NP. Combining data from all facilities, the median concentrations were 53.0 versus 38.6 gc/100 mL for enMF and NP, respectively (p=0.001). Using enMF, correlations were moderate to strong at SNF A ({rho} ranged 0.67 to 0.86, all p-values <0.001). Weak to moderate correlations can be explained by the sampled manhole not representing the entire facility (SNF B, {rho} ranged 0.47 to 0.72, p-values ranged <0.001 to 0.12) and longitudinal data gaps from summer heat and equipment maintenance (SNF C, {rho} ranged 0.14 to 0.59, p-values ranged 0.52 to <0.01). WWS can be a valuable tool for tracking dynamics of SARS-CoV-2 infections in healthcare facilities.
Pujolassos, M.; Kurilshikov, A.; Weersma, R. K.; Yang-Fu, J.; Zhernakova, A.; Calle, M. L.
Show abstract
While microbiome is increasingly recognized as crucial for human health, translating this knowledge into effective healthcare and preventive strategies remains challenging. Many studies focus on identifying changes in microbiome composition associated with disease and evaluating the potential of such disease-associated microbial profiles as biomarkers for disease diagnosis. Under the hypothesis that microbiome dysbiosis may reflect physiological alterations present long before disease onset, in this work, we analyse the potential of disease-specific microbial signatures not as a diagnostic tool when the disease is already present, but as a means of health assessment in the general population. Moreover, instead of trying to define a single health measure, we believe it is necessary to consider several ways in which the microbiome departs from health, according to different disease-related physiological changes. To evaluate our assumptions, we designed a two-stage study: the identification of disease-specific microbial signatures (discovery stage) and, subsequently, the study of their distribution in the general population to assess associations with general health (external validation stage). Specifically, in the discovery phase we characterized 16 disease-specific bacterial signatures from large public microbiome data using a compositional data analysis methodology. In the second phase, we quantified these microbial signatures in the Lifelines-DMP cohort, a large population-based cohort, and evaluated their association with self-reported health status. Results indicate that most disease-specific microbial signatures associate with health status, supporting our assumption that microbial composition can capture physiological alterations before disease onset, and highlighting the importance of considering multiple ways in which microbiome departs from a healthy state. These findings reaffirm the potential of microbial information as an additional tool in preventive medicine.
Jobe, N. I.
Show abstract
Background: Endocrine-disrupting chemicals (EDCs) in consumer products are ubiquitously detected in human biospecimens, yet most epidemiological studies examine single chemicals rather than real-world co-exposures. We evaluated associations between a mixture of seven urinary chemical biomarkers and systemic inflammation. Methods: Survey-weighted log-log regression models adjusted for age, sex, race/ethnicity, poverty-income ratio, and survey cycle were conducted with Benjamini-Hochberg FDR correction (primary analysis, N=4,864). A sensitivity analysis additionally adjusted for body mass index and smoking status (N=4,494). Results: In the primary analysis, 5 of 7 chemicals showed significant associations after FDR correction: ethylparaben ({beta} = -0.056, FDR P < .001), propylparaben ({beta} = -0.026, FDR P = .007), bisphenol A ({beta} = +0.052, FDR P = .005), monoethyl phthalate ({beta} = +0.043, FDR P = .002), and monocyclohexyl phthalate ({beta} = +0.215, FDR P = .007). The WQS mixture index was significantly associated with CRP ({beta} = +0.056, 95% CI [0.031, 0.081], P < .001), with monocyclohexyl phthalate carrying the largest mixture weight (0.342). In the BMI- and smoking-adjusted sensitivity analysis, associations attenuated to null for all chemicals, though MCP preserved direction ({beta} = +0.129) and the WQS mixture direction was maintained ({beta} = +0.018). Two multiple imputation sensitivity analyses confirmed that monocyclohexyl phthalate was the only chemical to maintain a positive direction across all four analytical specifications (primary complete-case, BMI-adjusted complete-case, primary-aligned imputation, and BMI-adjusted imputation), reaching statistical significance in three of four specifications and providing convergent evidence of a robust MCP-inflammation association. Conclusions: The chemical mixture showed a significant collective association with systemic inflammation, consistent with a cumulative pro-inflammatory burden from co-exposure to multiple consumer product chemicals. These findings suggest that regulatory approaches should shift from single-chemical to mixture-based risk assessment frameworks for consumer product safety.
Balogun, W. G.; Zeng, X.; Nafash, M. N.; Sehrawat, A.; Shi, R.; Svirsky, S. E.; Okonkwo, D. O.; Puccio, A. M.; Karikari, T. K.
Show abstract
Brain-derived tau (BD-tau) is an emerging blood-based biomarker for neurodegeneration, yet there are currently limited well validated BD-tau assays available for research and clinical use. To enhance access to this vital biomarker for neurological disorders including traumatic brain injury (TBI), we developed a novel blood-based immunoassay for BD-tau on the ultra-sensitive Quanterix HD-X platform using Single Molecule Array technology. Analytical validation assessed dilution linearity, specificity, precision, detection limits, and spike recovery, each recording robust metrics in agreement with international expert recommendations. The assay demonstrated robust validation metrics, achieving between-run stability of 95% when analyzing aliquots from six independent plasma and serum samples across five analytical runs. It also showed strong dilution linearity when diluted four-fold and achieved over 90% recovery when spiked with cerebrospinal fluid. Next, we evaluated the clinical utility of the assay in cohorts of individuals with traumatic brain injury (TBI), where strong performances were recorded whether using the 2-step or 3-step assay formats ({rho}= 0.94; p < 0.0001). Furthermore, plasma BD-tau distinguished samples from TBI patients based on time from injury and severity (AUC=0.93). Plasma BD-tau differentiated between favorable and unfavorable functional outcomes in the acute-severe group. Our findings underscore the significant potential of the BD-tau assay as a biomarker for TBI in the severe phase.
Fu, B.; DeSchepper, L. B.; Sun, J.; McKeithen-Mead, S. A.; Kapili, B.; Ochoa-Andersen, P.; Spencer, S. P.; Fardeen, T.; Ricardo, M.; El Kamari, V.; Sinha, S.; Relman, D. A.; Grembi, J. A.; Shalon, D.; Estrela, S.; Huang, K. C.
Show abstract
The human small intestine (SI) plays a central role in nutrient processing, host-microbe interactions, and immune regulation, yet remains poorly characterized due to the lack of minimally disruptive sampling methods. Here, we present a protocol for deploying, recovering, and analyzing samples collected using an ingestible device that enables multi-region, lumen-targeted SI sampling during normal digestion. The device incorporates a ~30-cm collapsible tube wound into pH- or time-responsive layers that sequentially unfurl in situ, typically capturing three spatially ordered samples with high yield and reliable retrieval. This protocol outlines study design, participant handling, device recovery, contamination control, and standardized workflows for analyses, including cell quantification, culturomics, sequencing, and metabolomics. We further describe benchmarking approaches for evaluating spatial resolution and strategies for assay prioritization when sample volume is limiting. By reducing participant burden and facilitating integration with stool, saliva, and clinical metadata, this approach enables longitudinal and large-cohort studies linking SI microbial ecology and host physiology to human health.
Morris, H.; Pritt, B. S.
Show abstract
Effective filtration and concentration of stool specimens is an essential pre-analytical step for reducing fecal debris and improving organism recovery using microscopy-based ova and parasite (O&P) examination. This study evaluated three commercially available fecal sedimentation-based filtration/concentration systems, ParaPak SpinCon (Meridian Bioscience), Mini Parasep SF (Apacor), and the newly-available ParadeviceReingenuity), for qualitative parasite detection and workflow logistics using conventional and artificial intelligence (AI)-assisted microscopy. Forty clinical stool specimens (20 parasite-positive and 20 parasite-negative) were processed with the 3 devices, and the resultant 120 wet mount and 120 trichrome stained smear preparations were examined using conventional microscopy. Trichrome-stained slides were also scanned at 40x magnification using a Hamamatsu NanoZoomerS360 flatbed digital slide scanner and images were analyzed using the Techcyte Fusion Human Fecal Trichrome AI algorithm. Positive and indeterminate digital findings were confirmed by conventional glass slide microscopy. Slides and digital images were reviewed in a blinded manner. Concordance was assessed among the 360 initial evaluations (microscopy and AI-assisted), and discrepant parasitology results were resolved through re-review and specimen reprocessing as needed. Final qualitative agreement across slide/image evaluations using all three concentration systems was 100%. Minor discrepancies in protozoan and white/red blood cell detection/identification were noted in 5 and 7 cases, respectively, and likely reflected sampling and observer variability. While the three concentration systems produced equivalent qualitative results, the Paradevice and Mini Parasep SF offered the most streamlined workflows. These findings support the Paradevice and Mini Parasep SF as efficient, analytically equivalent systems that are compatible with traditional and AI-assisted O&P workflows.
Gonzales, M.; Kang, X.; Adamson, M. M.; Chao, S. Z.; Yoon, B. C.
Show abstract
PURPOSE: Alzheimer disease (AD) is associated with cognitive impairment, brain atrophy, and elevated amyloid-beta and tau. The study aimed to characterize regional atrophy associated with elevated amyloid-beta and tau, as measured by [18F]florbetapir (FBP) and [18F]flortaucipir (FTP) positron emission tomography (PET), respectively, and determine whether combining PET and atrophy data improves the prediction of cognitive impairment. METHODS: Alzheimer Disease Neuroimaging Initiative data (n = 381) were retrospectively analyzed. PET results were correlated with cortical thickness, gray matter (GM) volumes, Mini-Mental State Examination, and Montreal Cognitive Assessment. Linear/logistic regression and area under the curve (AUC) were used to evaluate for significant correlations and compare performances in distinguishing cognitive impairment, respectively. RESULTS: Incremental loss of cortical thickness and GM volume was observed from FBP-/FTP- (n = 205) to single PET-positive (FBP+/FTP-, n = 133; FBP-/FTP+, n = 5) and FBP+/FTP+ (n = 38) groups, particularly in the temporal and parietal lobes. FBP+/FTP+ showed the most severe cortical thickness loss in the entorhinal cortex, temporal lobe GM atrophy, and cognitive impairment. Adding brain atrophy as the third variable resulted in higher odds ratios and improved AUCs for cognitive impairment, with FBP+/FTP+/temporal GM or entorhinal cortical atrophy+ demonstrating the strongest associations with cognitive impairment. CONCLUSION: A multimodal approach combining PET and MRI may help improve the assessment of cognitive impairment in AD.
Ernandez, J.; Xiang, L.; Adler, R.; Hsu, J.; Shah, S. K.; Kim, D.; Gershman, B.; Mossanen, M.; Weissman, J. S.
Show abstract
OBJECTIVE: Bladder cancer (BC) is predominantly a disease of older, comorbid adults, and radical cystectomy (RC), which is the gold standard treatment, carries considerable morbidity. We sought to determine the impact of baseline dementia and frailty on the care trajectory beyond the immediate postoperative period. We hypothesized that frail patients and those with dementia undergoing RC for BC will have poorer care trajectories. METHODS AND MATERIALS: We identified Medicare beneficiaries [≥] 66 years old who underwent RC for BC in 2017 with 12 months of pre- and post-RC enrollment. Frailty and dementia were characterized using validated, claims-based measures. Associations between baseline frailty and dementia with postoperative care trajectory outcomes were determined using Fine-Gray competing risk models. RESULTS: We identified 3,600 beneficiaries of whom 11.6% were frail and 3.4% met criteria for dementia. Patients with dementia were more likely to be frail, comorbid, and not receive standard-of-care neoadjuvant chemotherapy. Frailty was independently associated with [≥] 2 transitions in care level after index discharge from RC and skilled nursing facility (SNF) admissions within 1 year of RC, exposure to intensive post-RC interventions, including dialysis and feeding tube placement, and poorer survival. Dementia remained associated with SNF admissions regardless of frailty level. CONCLUSIONS: Among a contemporary cohort of older adults undergoing RC for BC, preoperative dementia and frailty were independently associated with poorer care trajectory beyond the immediate postoperative period after RC. Our work highlights a role for preoperative geriatric assessment in identifying and optimizing patients at greatest risk.
shao, w.; Ammerman, B.; Jacobucci, R.
Show abstract
Suicidal risk may be encoded in everyday communication patterns but diluted in routine digital interactions. We introduce a method for surfacing this latent signal: training per-person language model agents on individuals' authored text (the on-screen text each participant typed, captured whenever a keyboard was visible in screenshots) and placing those agents in simulated social interactionsa communicative stress test. Using data from 79 adults with recent suicidal ideation, we ne-tuned individual LoRA adapters on Qwen3-8B using each participant's authored text, then placed agents in standardized conversations with probe personas. Agent-generated risk language was associated with EMA-measured suicidal ideation (r= .576, p < .001), with a single neutral small-talk probe performing nearly as well (r= 551). A shue control conrmed the signal is person-specic (r= .071 when adapters were mismatched), and automated descriptions of participants' general smartphone activity produced no signal, conrming specicity to interpersonal communication. A prompt ablation demonstrated partial robustness to removal of disclosure-encouraging language (r = .430). This proof-of-concept demonstrates that simulated social interaction can amplify latent vulnerability signals, bridging digital phenotyping, generative AI, andsuicide theory.
Beer, S.; Simpkin, A. J.; Eldeeb, S. Y.; Zar, H. J.; Stein, D. J.; Dunn, E. C.; Smith, A. D. A. C.
Show abstract
Background: In prospective cohort studies, where an exposure is collected repeatedly, interest often lies in determining whether the timing of that exposure has a differential effect on a later outcome. The Structured Life Course Modeling Approach (SLCMA), where users select between temporal hypotheses of exposure specified a priori, provides one way to analyse such longitudinal data. However, few studies using SLCMA consider the effect of time-varying covariates (TVC) which may impact associations. Methods: We present a modified version of the SLCMA - called direct and mediated effects (DME)-SLCMA - which corrects for TVC. We first develop the DME-SLCMA method, test it through simulation, and apply it to psychosocial data from the Drakenstein Child Health Study (DCHS, n=336) to investigate relationships between maternal psychopathology, TVC of socioeconomic status, and offspring depressive symptoms. Results: We found that, on average, offspring depressive symptoms score increased by 3.9% (95% CI: 1.0%-6.9%, p = 0.039) for each unit of maternal psychopathology (SRQ) at 48 months whilst adjusting for time-varying socioeconomic status (at 18, 30, 42 and 54 months). Our simulations identified several realistic scenarios where selections ignoring TVC - with TVC mediated exposure effects present - were prone to be incorrect, including our DCHS example. Conclusion: DME-SLCMA is a robust new approach for life course modelling in the presence of time-varying covariates. We recommend adjusting for TVC whenever possible, and, when not possible, our simulation study identified that scenarios where mediated effects are comparable, or greater, in magnitude to direct effects are most prone to confounding.
Hu, L.; Bass, M.; Patridge, E.; Molusky, M.; Antoine, G.; Vuyisich, M.; Banavar, G.
Show abstract
Background: Chronic diseases and symptom syndromes often develop after prolonged biological changes that may precede formal diagnosis. RNA-based metatranscriptomics captures active microbial and human gene expression and may provide a functional layer for disease risk evaluation. To address this translational gap, we developed and validated a Disease Risk Score (DRS) framework that integrates metatranscriptome-derived pathway activity scores from stool, saliva, and blood samples, and evaluated its potential clinical utility as an adjunct risk-evaluation tool. Methods: DRS uses disease-specific sets of pathway activity scores derived from stool and saliva microbial functions, stool and saliva microbial taxa, and blood human gene expression. For each disease, 'not optimal' pathway scores are aggregated into a normalized cumulative odds ratio, or cOR, using score-level odds ratios, statistical significance, and literature-supported biological relevance derived from a Development Cohort of 22,369 individuals. A cOR [≥] 5 is defined as high risk. Performance is evaluated in an independent Validation Cohort of 15,908 individuals using self-reported diseases as the reference. Disease support requires both significant cOR separation between self-reported and not-reported (Cohen's d [≥] 0.2) and risk ratio enrichment of self-reported disease among individuals classified as high risk (95% CI of Risk Ratio > 1). Results: Of 20 initially evaluated diseases, 15 meet the prespecified validation criteria on the independent validation cohort: ADHD, anxiety, chronic fatigue syndrome, depression, GERD, hypertension, inflammatory bowel disease, IBS-C, IBS-D, insomnia, MASLD, obesity, obstructive sleep apnea, Sjogren's syndrome, and type 2 diabetes. Five selected clinical scenarios illustrate how DRS can support clinician-mediated decision making, including IBS subtype reclassification, improved diagnostic acceptance in IBS-D, personalized lifestyle counseling in MASLD and early type 2 diabetes, and diagnostic uncertainty in atypical GERD. Conclusions: DRS is a metatranscriptomics-based risk-stratification framework that aggregates active microbial and human pathway signals into interpretable disease-specific risk estimates across a wide range of disease conditions. Validation against self-reported disease labels in an independent cohort shows significant risk enrichment for each of 15 diseases. DRS is intended as an adjunct to clinical evaluation: a decision support tool in situations where routine care encounters uncertainty, delay, or low patient engagement. Future prospective studies using clinically adjudicated endpoints are needed to assess calibration and clinical outcomes.
Cantrell, L.; Karampatsas, K.; Andrews, N.; Beach, S.; Bentley, E.; Berardi, A.; Bijlsma, M. W.; Cagil Kocana, C.; Daniel, O.; French, N.; Hall, T.; Izu, A.; Khalil, A.; Kwatra, G.; Kyohere, M.; Madhi, S. A.; Mboizi, R.; Miselli, F.; Nielsen, M.; Thorn, N.; van de Beek, D.; Walker, K.; Heath, P. T.; Le Doare, K.; Voysey, M.; PREPARE WP3 Study Group,
Show abstract
Vaccines to prevent infant group B streptococcus (GBS) disease are advancing, with licensure likely based on safety and immunologic endpoints rather than clinical efficacy data. This approach requires robust, generalisable serological thresholds of risk reduction (SToRRs). We combined data from six case-control studies in Europe and Africa to define SToRRs for early-onset (EOD) and late-onset (LOD) GBS disease. Across diverse epidemiological and healthcare settings, anti-capsular polysaccharide IgG concentrations were consistently higher in infants who remained disease free than in those who developed disease. Higher antibody concentrations were required to reduce the risk of EOD than LOD, and higher concentrations were required for serotype Ia than for serotype III. This study provides a quantitative framework to support correlates-based evaluation and potential licensure of maternal GBS vaccines.
Juhasz, J.; DeFeis, B.; Britton, M. K.; Hoogerwoerd, H.; Worwag, K.; Johnson, K. J.; Uribe, A.; Williamson, J. B.; Porges, E. C.; Cohen, R. A.
Show abstract
Introduction: Brain-predicted age, estimated from structural MRI data, is a machine-learning biomarker of biological brain aging. Greater brain age gap (BAG) indicates advanced brain aging and is associated with cognitive decline and mortality. Cardiometabolic risk factors, including elevated blood glucose, body mass index (BMI), blood pressure, and cholesterol, increase risk of cognitive impairment and dementia in aging. Their relationship with BAG in severe obesity remains poorly characterized despite increased prevalence of cardiometabolic risk factors among this population. Methods: T1-weighted MRI data from 97 adults (BMI 35-73) were used to calculate BAG using ENIGMA and Pyment brain age models. Associations between BAG and HbA1c, BMI, hypertension, and hyperlipidemia were examined using multiple linear regression and MM-estimation robust regression, adjusting for age, sex, and race. Post hoc analyses stratified models by clinical HbA1c cutoffs (normoglycemic, prediabetic, diabetic). Results: Higher HbA1c was associated with greater BAGENIGMA (B = 1.58, p = .014) and BAGPyment (B = 0.93, p = .013) in linear regression models. In robust models, HbA1c remained significantly associated with BAGENIGMA (B = 1.70, p = .002) but not BAGPyment (B = 0.71, p = .13). BMI, hypertension, and hyperlipidemia were not associated with BAG in either linear or robust models. HbA1c was associated with greater BAGENIGMA (B = 2.15, p = .01) and BAGPyment (B =1.21, p = .04) in those at or above prediabetic levels and with BAGENIGMA (B = 2.49, p = .047) in those with diabetes. Conclusions: Elevated HbA1c is associated with accelerated brain aging in individuals with severe obesity. BAG was not associated with BMI, hypertension, and hyperlipidemia, which may reflect the restricted BMI range inherent to the sample with severe obesity.
Gobeil, E.; Bourgault, J.; Enault, M.; Cote, V.; Mitchell, P. L.; Ruel, L.-J.; Girard, A. S.; Vohl, M.-C.; Arsenault, B. J.
Show abstract
Metabolic dysfunction-associated steatotic liver disease (MASLD) is rapidly increasing worldwide, yet effective targeted therapies remain limited. To better understand the molecular mechanisms underlying MASLD, we performed an integrated proteogenomic analysis of human liver tissue. Using mass spectrometry, we quantified 2,744 proteins in 504 liver biopsies from the Quebec Obesity Biobank and examined changes across disease stages. To investigate causality, we integrated liver proteomics with RNA sequencing and genome-wide genotyping to map thousands of protein quantitative trait loci (pQTLs) and expression quantitative trait loci (eQTLs). These molecular data were combined with summary statistics from a meta-analysis of genome-wide association studies including 16,532 MASLD cases and 1,240,188 controls. Mendelian randomization and genetic colocalization analyses revealed that most proteins differentially expressed across MASLD stages were not causally implicated in disease risk, whereas several genetically predicted liver proteins showed evidence of causal effects. Among these, higher hepatic levels of the MTARC1 protein were causally associated with MASLD and hepatic fat accumulation. Phenome-wide analyses suggested that MTARC1 inhibition may reduce the risk of cirrhosis, hepatocellular carcinoma, and cholelithiasis while improving lipid profiles. Notably, the causal MTARC1 variant influenced liver protein levels but not gene expression. Genetic analyses also identified ERLIN1 and HSD17B13 as potential therapeutic targets. In contrast, eQTLs and pQTLs at other loci such as GCKR showed opposite effects on MASLD risk. These findings highlight the importance of integrating tissue proteomics with human genetics to distinguish biomarkers from causal drivers and to identify promising therapeutic targets for MASLD.
Chen, M.; Li, X.; Yang, K.; Taramasso, M.
Show abstract
**Abstract** **Background:** Transcatheter edge-to-edge repair (TEER) is an established treatment for mitral regurgitation but remains highly dependent on operator experience and complex transesophageal echocardiography (TEE)-guided intraprocedural imaging. Artificial intelligence (AI)-based semantic segmentation may improve procedural reproducibility and intraprocedural guidance; however, no TEER-specific segmentation framework has been reported. **Objectives:** To develop and evaluate AutoClip, a clinician-driven AI-guided TEE semantic segmentation model designed for simultaneous delineation of mitral valve anatomy and in-vivo TEER device components. **Methods:** A retrospective proof-of-concept study was conducted using 987 intraprocedural TEE frames derived from 10 video clips in 3 patients undergoing MitraClip G4 implantation. Seven semantic labels, including mitral leaflets and device components, were manually annotated using ITK-SNAP. Following standardized preprocessing and region-of-interest extraction, an Attention U-Net architecture was trained frame-wise on bicommissural and corresponding X-plane TEE views. Model performance was assessed using mean intersection-over-union (IoU) and Dice coefficient on an independent test set. **Results:** The Attention U-Net demonstrated improved sensitivity to small device structures compared with conventional U-Net architectures. Preliminary training performance achieved a mean IoU of approximately 0.93, while independent test performance reached a mean IoU of 0.46 across foreground classes. Qualitative assessment demonstrated feasible simultaneous segmentation of mitral leaflets, clip arms, grippers, and delivery shaft during TEER procedures. **Conclusions:** AutoClip represents a proof-of-concept TEER-specific TEE semantic segmentation framework initiated through a clinician-oriented workflow without formal computer science expertise. Although preliminary accuracy remains modest due to limited sample size, this study establishes a reproducible pathway for future AI-assisted intraprocedural guidance systems and larger multicenter development efforts in structural heart interventions.
Geoly, A.; McCalley, D. M.; Struckmann, W.; Azeez, A.; Wong, B.; Kim, B.; Ninomiya, S.; Ahmed, S.; Kim, J. P.; McRae-Clark, A. L.; Froeliger, B.; Sahlem, G. L.
Show abstract
Background: Repetitive Transcranial Magnetic Stimulation (rTMS) is a promising treatment across addictive disorders including Cannabis Use Disorder (CUD). Targeting incentive-salience circuitry via the ventromedial prefrontal cortex (vmPFC) and central-executive circuitry via the left dorsolateral prefrontal cortex (LDLPFC) are both promising treatment approaches; however, to date structural targets have predominated whereas functional targeting may allow for more precision. In this pilot trial we adapted a functional Magnetic Resonance Imaging (fMRI) Regulation of Craving (ROC) task to generate fMRI-based rTMS targets in the vmPFC and LDLPFC. Methods: We recruited treatment-seeking participants with moderate or severe CUD as a part of an open-label trial and administered an adapted ROC-task during fMRI following 24-hours of cannabis abstinence. We identified sub-portions of maximal activation of the LDLPFC when participants thought of long-term consequences of cannabis use (Later) and of the vmPFC when participants thought of short-term positive aspects of cannabis use (Now). We hypothesized that our task would generate acceptable rTMS targets in >66% of baseline fMRI scans. Results: A total of 20-participants enrolled in the trial (50%F, age=33.3+9.8) and completed the baseline fMRI. The adapted ROC-task elicited group level activation in the LDLPFC and precuneus in the Later>Now and in the bilateral vmPFC, ACC, and striatum in the Now>Later contrast. Acceptable functional targets resolved in both the vmPFC and LDLPFC in 19 of 20 participants (one participant did not tolerate MRI). Conclusions: The adapted ROC-task elicits activation in incentive salience and central executive circuitry and can feasibly generate rTMS targets when using a cluster selection algorithm.
Schwoebel, J.; Semenec, I.; Rousseva, J.; Frasch, M. G.; Thorstenson, R.; Bhatt, M.
Show abstract
Large language models embedded in autonomous agents process trusted instructions and untrusted data in one context window, leaving them open to direct and indirect prompt injection. In healthcare this is not hypothetical: a 2025 JAMA Network Open study found commercial medical LLMs followed injected instructions in 94.4% of simulated patient encounters, including life threatening recommendations . Yet the clinically decisive problem we quantify here is different. Most real clinical threats protected health information PHI exfiltration, cross patient access, bulk export, out of scope advice are fluent, legitimate looking requests that carry no attack signal, so even a state of the art injection detector passes them. Existing runtime guardrails trade safety against latency: model based auditors are accurate but add hundreds of milliseconds of Python inference, while lexical filters are fast but blind to obfuscated or semantically disguised payloads. We present QFIRE, an inline, provider agnostic prompt firewall implemented as a single self contained Rust toolchain proxy, CLI, and benchmark harness. QFIRE combines three mechanisms: (i) positive security scope constraints, which restrict a model call to a declared natural language purpose and block out of scope drift even when no overt attack token is present; (ii) an asynchronous detector graph that runs N rules and their detector nodes concurrently, cheapest checks first; and (iii) a de obfuscation pass that decodes Base64 hex ROT13, folds homoglyphs and leetspeak, and strips zero width characters before detection. QFIRE ships 106 versioned firewall rules and a dedicated HIPAA Safe Harbor 18 identifier PHI panel, and runs a local DeBERTa v3 injection classifier via embedded ONNX Runtime. On 1968 public prompt injection and jailbreak prompts QFIREs deterministic hybrid attains F1 0.86, statistically tied with Metas state of the art PromptGuard 2 0.86 and above protectai DeBERTa v3 0.83; lexical baselines lag 0.16 to 0.50. Our central result is on QFIRE HealthBench, a new 2000 prompt healthcare benchmark we build and release with real garak and Microsoft PyRIT payloads. There the same PromptGuard-2 recovers only 0.40 recall DeBERTa v3 0.57, because most clinical threats carry no injection signal; QFIREs combined scope plus PHI chain reaches 0.83 recall F1 0.87 at a calibrated 0.08 false positive rate. Generic injection detection, even state of the art, is therefore necessary but not sufficient for healthcare agents. A bare LLM judge also closes most of this static corpus gap F1 0.90; QFIREs contribution beyond static accuracy is auditable determinism, bounded latency, and adaptive robustness, where the bare judge falls to 34 to 59% recall section 5.5. End to end, placing QFIRE in front of a tool using agent over a mock EHR sandbox cuts the agents harmful action rate from 0.38 to 0.00 at a 0.13 benign utility cost. All code, rules, corpora snapshots, and scripts are released, and every table regenerates from a single make paper target against local models with no paid API keys.
Zheng, Y.; Feng, B.; Cheng, R.; Qiu, C.; Long, Z.; Vaziri, K.; Hahn, J.
Show abstract
Accurate assessment of body composition is important to risk stratification and management of metabolic, musculoskeletal, and aging-related diseases, yet reference modalities such as Dual-energy X-ray absorptiometry (DXA) are costly and impractical for frequent monitoring. Commodity 3D body scans offer a low-cost, radiation-free alternative, but extracting meaningful and predictive shape features from scans remains challenging due to nonuniform point density, variable body size and cross-device differences. We introduce BodyMAE, a self-supervised, surface-area aware masked autoencoder for metric-scale 3D body scans. The pipeline integrates area-adjusted sampling, a long-range focused encoder, and a lightweight decoder regularized to promote locally uniform reconstructions. Trained and evaluated on 917 paired 3D body scans paired with clinical DXA reports, BodyMAE achieves strong accuracy on fat percentage (root-mean-square error (RMSE) 3.825 percentage points, R^2 0.908), fat mass (RMSE 3.694 kg, R^2 0.968), and lean mass (RMSE 3.608 kg, R^2 0.901), with competitive performance on bone mineral content (RMSE 0.284 kg, R^2 0.754).We also assess feature stability across pretrained baselines, finding higher retrieval accuracy for our representations (Top-1 90.131%). These results indicate that combining metric-aware sampling, long-range relational encoding, and local geometric regularization enables accurate body composition estimation from 3D body scans, as validated by comparisons to DXA-derived measurements.
Collier, A.
Show abstract
Background Electronic health record documentation patterns may reflect workflow complexity, monitoring intensity, and operational strain in intensive care settings. However, documentation-derived features can be sensitive to local documentation culture, data capture systems, and outcome definitions. Retrospective validation across multiple datasets is therefore needed before these signals are used in workflow intelligence or clinical AI governance tools. Objective To evaluate whether documentation-density and documentation-timing features show reproducible retrospective signal for ICU workflow complexity and long-stay proxy outcomes across de-identified critical care datasets, while distinguishing workflow and long-stay associations from unsupported claims about mortality prediction, burden reduction, or deployment readiness. Methods We synthesized retrospective validation results from de-identified ICU and workflow datasets generated through a prespecified documentation-density validation program. Feature families included Documentation Burden Score style features, Shift-End Documentation Rate style features, documentation reliability style metadata, and all-documentation feature sets where available. Outcomes included long ICU length of stay proxies, mortality where available, and workflow proxy endpoints. Models compared baseline feature sets with enhanced models containing documentation-density or workflow features. Performance was summarized using area under the receiver operating characteristic curve, Brier score where reported, delta AUROC, bootstrap confidence intervals where reported, and label-shuffle controls where available. Results The strongest external long-stay proxy evidence came from the NWICU chartevents analysis, which included 28,612 ICU stays, 20,267 stays with chart events, and 9,619,759 chart events. For ICU length of stay greater than the median, baseline AUROC was 0.5252. Enhanced AUROC was 0.9512 for Documentation Burden Score features, 0.9214 for Shift-End Documentation Rate features, 0.8470 for documentation reliability style features, and 0.9517 for all documentation features. Corresponding label-shuffle enhanced AUROCs were near random, ranging from 0.4897 to 0.5064. For ICU length of stay greater than the 75th percentile, baseline AUROC was 0.5155. Enhanced AUROC was 0.9433 for Documentation Burden Score features, 0.9194 for Shift-End Documentation Rate features, 0.8118 for documentation reliability style features, and 0.9427 for all documentation features, with label-shuffle enhanced AUROCs from 0.4836 to 0.4999. Additional retrospective support was observed in eICU workflow analyses, HiRID first-24-hour documentation-density analyses, MIMIC-IV HF ICU internal analyses, MIMIC-IV-Note metadata extensions, and nursing-chart or lab density proxy analyses. However, cross-institution discrimination transfer was weak without recalibration, and several analyses remained proxy validations rather than final clinical validations. Conclusions Documentation-density and documentation-timing features show promising retrospective signal for ICU workflow complexity and long-stay proxy outcomes, especially in NWICU chartevents and selected internal dataset-specific analyses. These findings support further preregistered, prospective, silent-mode validation of documentation-derived workflow intelligence. They do not establish prospective clinical performance, mortality reduction, clinician burden reduction, autonomous deterioration prediction, or deployment readiness.